ReproHack Hub

Browse ReproHack papers

Living HTA: Automating Health Technology Assessment with R

Authors: Robert A. Smith, Paul P. Schneider, Wael Mohammed

DOI: 10.12688/wellcomeopenres.17933.1

Submitted by rasmith3

Why should we attempt to reproduce this paper?
We think this is an interesting paper for anyone who wants to learn to build an API with the R package plumber. This is a novel method in health economics, but we believe will help improve the transparency of modelling methods in our field.

Tags: R Shiny Health Economics HTA plumber
Accelerating the prediction of large carbon clusters via structure search: Evaluation of machine-learning and classical potentials

Authors: Bora Karasulu, Jean-Marc Leyssale, Patrick Rowe, Cedric Weber, Carla de Tomas

DOI: 10.1016/j.carbon.2022.01.031

Submitted by bkarasulu
Number of reviews: 1
Why should we attempt to reproduce this paper?
This paper presents a fine example of high-throughput computational materials screening studies, mainly focusing on the carbon nanoclusters of different sizes. In the paper, a set of diverse empirical and machine-learned interatomic potentials, which are commonly used to simulate carbonaceous materials, is benchmarked against the higher-level density functional theory (DFT) data, using a range of diverse structural features as the comparison criteria. Trying to reproduce the data presented here (even if you only consider a subset of the interaction potentials) will help you devise an understanding as to how you could approach a high-throughput structure prediction problem. Even though we concentrate here on isolated/finite nanoclusters, AIRSS (and other similar approaches like USPEX, CALYPSO, GMIN, etc.,) can also be used to predict crystal structures of different class of materials with applications in energy storage, catalysis, hydrogen storage, and so on.

Tags: Python HPC LAMMPS DFT interatomic potentials Python scripting AIRSS structure prediction density functional theory high-throughput machine-learning
Finding Efficient Trade-offs in Multi-Fidelity Response Surface Modeling

Authors: Sander van Rijn, Sebastian Schmitt, Matthijs van Leeuwen, Thomas Bäck

Submitted by sjvrijn
Mean reproducibility score: 9.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
Because: - Two fellow PhDs working on different topics have been able to reproduce some figures by following the README instructions and I hope this extends to other people - I've tried to incorporate as many of the best practices as possible to make my code and data open and accessible - I've tried to make sure that my data is exactly reproducible with the specified random seed strategy - the paper suggests a method that should be useful to other researchers in my field, which is not useful unless my results are reproducible

Tags: Python HPC Computer Science
Where should new parkrun events be located? Modelling the potential impact of 200 new events on socio-economic inequalities in access and participation

Authors: Schneider PP, Smith RA, Bullas AM, Bayley T, Haake SS, Brennan A, Goyder E

Submitted by hub-admin
Mean reproducibility score: 7.0/10 | Number of reviews: 3
Why should we attempt to reproduce this paper?
If all went right, the analysis should be fully reproducible without the need to make any adjustments. The paper aims to find optimal locations for new parkruns, but we were not 100% sure how 'optimal' should be defined. We provide a few examples, but the code was meant to be flexible enough to allow potential decision makers to specify their own, alternative objectives. The spatial data set is also quite interesting and fun to play around with. Cave: The full analysis takes a while to run (~30+ min) and might require >= 8gb ram.

Tags: R GDAL GEOS GIS Shiny PROJ
Algorithm configuration data mining for CMA evolution strategies

Authors: Sander van Rijn, Hao Wang, Bas van Stein, Thomas Bäck

DOI: 10.1145/3071178.3071205

Submitted by sjvrijn
Mean reproducibility score: 10.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
The original data took quite a while to produce for a previous paper, but for this paper, all tables and figures should be exactly reproducible by simply running the jupyter notebook.

Tags: Python HPC Computer Science
Open Trade Statistics

Authors: Pachá (Mauricio Vargas Sepúlveda)

Submitted by hub-admin

Why should we attempt to reproduce this paper?
The focus of the project is reproducibility. Here we show the differences to access data compared to similar initiatives: https://ropensci.org/blog/2019/05/09/tradestatistics/. Also, similar projects have obscure parts, while our exposes the code from raw data downloading to dashboard creation.

Tags: R Shiny
Spatial modelling of rice yield losses in Tanzania due to bacterial leaf blight and leaf blast in a changing climate

Authors: C. Duku, A. H. Sparks, S. J. Zwart.

DOI: 10.1007/s10584-015-1580-2

Submitted by hub-admin
Mean reproducibility score: 4.0/10 | Number of reviews: 2
Why should we attempt to reproduce this paper?
This was my third attempt at making a paper fully reproducible. To date I it's the most reproducible that I have published. I'm interested to know what stumbling blocks exist that I'm not aware of (aside from needing software like ArcGIS to fully rerun the complete analysis).

Tags: Python R ArcGIS

Search for papers

Filter by tags

Python R GDAL GEOS GIS Shiny PROJ Galaxies Astronomy HPC Databases Binder Social Science Stata make Computer Science Jupyter Notebook tidyverse emacs literate earth sciences clumped isotopes org-mode geology eyetracking LaTeX Git ArcGIS Docker Drake SVN knitr C Matlab Mathematica Meta-analysis swig miniconda tensorflow keras Pandas SQL neuroscience robotics deep learning planner reiforcement learning Plasma physics Hybrid-PIC EPOCH Laser Gamma-ray X-ray radiation Petawatt Fortran plasma PIC physics Monte Carlo Atomistic Simulation LAMMPS Electron Transport DFT descriptors interatomic potentials machine learning Molecular Dynamics Python scripting AIRSS structure prediction density functional theory high-throughput machine-learning RNA bioinformatics CFD Fluid Dynamics OpenFOAM C++ DNS Mathematics Droplets Basilisk Particle-In-Cell psychology Stan Finance SAS Replication crisis Economics Malaria consumer behavior number estimation mental arithmetic psychophysics Archaeology Precipitation Epidemiology Parkrun Health Health Economics HTA plumber science of science Zipf networks city size distribution urbanism literature review Preference Visual Questionnaire Mann-Whitney Correlation Conceptual replication Cognitive psychology Multinomial processing tree (MPT) modeling #urbanism #R k-means cluster analysis city-regions Urban Knowledge Systems Topic modelling Planning Support Systems Software Citation Quarto snakemake Numerical modelling Ocean climate physical oceanography apptainer oceanography All tags Clear tags

Key

Associated with an event
Available for general review
Public reviews welcome

Papers

Browse ReproHack papers

Living HTA: Automating Health Technology Assessment with R

Authors: Robert A. Smith, Paul P. Schneider, Wael Mohammed

DOI: 10.12688/wellcomeopenres.17933.1

Submitted by rasmith3

Accelerating the prediction of large carbon clusters via structure search: Evaluation of machine-learning and classical potentials

Authors: Bora Karasulu, Jean-Marc Leyssale, Patrick Rowe, Cedric Weber, Carla de Tomas

DOI: 10.1016/j.carbon.2022.01.031

Submitted by bkarasulu

Finding Efficient Trade-offs in Multi-Fidelity Response Surface Modeling

Authors: Sander van Rijn, Sebastian Schmitt, Matthijs van Leeuwen, Thomas Bäck

Submitted by sjvrijn

Where should new parkrun events be located? Modelling the potential impact of 200 new events on socio-economic inequalities in access and participation

Authors: Schneider PP, Smith RA, Bullas AM, Bayley T, Haake SS, Brennan A, Goyder E

Submitted by hub-admin

Algorithm configuration data mining for CMA evolution strategies

Authors: Sander van Rijn, Hao Wang, Bas van Stein, Thomas Bäck

DOI: 10.1145/3071178.3071205

Submitted by sjvrijn

Open Trade Statistics

Authors: Pachá (Mauricio Vargas Sepúlveda)

Submitted by hub-admin

Spatial modelling of rice yield losses in Tanzania due to bacterial leaf blight and leaf blast in a changing climate

Authors: C. Duku, A. H. Sparks, S. J. Zwart.

DOI: 10.1007/s10584-015-1580-2

Submitted by hub-admin

Search for papers

Filter by tags

Key